Search | WHO COVID-19 Research Database

Outbreak.info Research Library: A standardized, searchable platform to discover and explore COVID-19 resources (preprint)

Laura Hughes; Ginger Tsueng; Julia Mullen; Manar Alkuzweny; Marco Cano; Benjamin Rush; Emily Haag; Alaa Abdel Latif; Xinghua Zhou; Zhongchao Qian; Emory Hufbauer; Mark Zeller; Kristian Andersen; Chunlei Wu; Andrew Su; Karthik Gangavarapu.

researchsquare; 2022.

Preprint in English | PREPRINT-RESEARCHSQUARE | ID: ppzbmed-10.21203.rs.3.rs-1723808.v1

ABSTRACT

To combat the ongoing COVID-19 pandemic, scientists have been conducting research at breakneck speeds, producing over 52,000 peer-reviewed articles within the first year. To address the challenge in tracking the vast amount of new research located in separate repositories, we developed outbreak.info Research Library, a standardized, searchable interface of COVID-19 and SARS-CoV-2 resources. Unifying metadata from fourteen repositories, we assembled a collection of over 270,000 publications, clinical trials, datasets, protocols, and other resources as of May 2022. We used a rigorous schema to enforce consistency across different sources and resource types and linked related resources. Researchers can quickly search the latest research across data repositories, regardless of resource type or repository location, via a search interface, public API, and R package. Finally, we discuss the challenges inherent in combining metadata from scattered and heterogeneous resources and provide recommendations to streamline this process to aid scientific research.

Subject(s)

COVID-19

Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations (preprint)

Laura Hughes; Karthik Gangavarapu; Alaa Abdel Latif; Julia Mullen; Manar Alkuzweny; Emory Hufbauer; Ginger Tsueng; Emily Haag; Mark Zeller; Christine Aceves; Karina Zaiets; Marco Cano; Jerry Zhou; Zhongchao Qian; Rachel Sattler; Nathaniel Matteson; Joshua Levy; Raphael Lee; Lucas Freitas; Sebastian Maurer-Stroh; Marc Suchard; Chunlei Wu; Andrew Su; Kristian Andersen.

researchsquare; 2022.

Preprint in English | PREPRINT-RESEARCHSQUARE | ID: ppzbmed-10.21203.rs.3.rs-1723829.v1

ABSTRACT

The emergence of SARS-CoV-2 variants of concern has prompted the need for near real-time genomic surveillance to inform public health interventions. In response to this need, the global scientific community, through unprecedented effort, has sequenced and shared over 11 million genomes through GISAID, as of May 2022. This extraordinarily high sampling rate provides a unique opportunity to track the evolution of the virus in near real-time. Here, we present outbreak.info, a platform that currently tracks over 40 million combinations of PANGO lineages and individual mutations, across over 7,000 locations, to provide insights for researchers, public health officials, and the general public. We describe the interpretable and opinionated visualizations in the variant and location focussed reports available in our web application, the pipelines that enable the scalable ingestion of heterogeneous sources of SARS-CoV-2 variant data, and the server infrastructure that enables widespread data dissemination via a high performance API that can be accessed using an R package. We present a case study that illustrates how outbreak.info can be used for genomic surveillance and as a hypothesis generation tool to understand the ongoing pandemic at varying geographic and temporal scales. With an emphasis on scalability, interactivity, interpretability, and reusability, outbreak.info provides a template to enable genomic surveillance at a global and localized scale.

Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations (preprint)

Karthik Gangavarapu; Alaa Abdel Latiff; Julia L Mullen; Manar Alkuzweny; Emory Hufbauer; Ginger Tsueng; Emily Haag; Mark Zeller; Christine M Aceves; Karina Zaiets; Marco Cano; Jerry Zhou; Zhongchao Qian; Rachel Sattler; Nathaniel L Matteson; Joshua I Levy; Marc A Suchard; Chunlei Wu; Andrew I Su; Kristian G Andersen; Laura D Hughes.

medrxiv; 2022.

Preprint in English | medRxiv | ID: ppzbmed-10.1101.2022.01.27.22269965

ABSTRACT

The emergence of SARS-CoV-2 variants has prompted the need for near real-time genomic surveillance to inform public health interventions. In response to this need, the global scientific community, through unprecedented effort, has sequenced over 7 million genomes as of December 2021. The extraordinarily high sampling rate provides a unique opportunity to track the evolution of the virus in near real-time. Here, we present outbreak.info, a platform that can be used to track over 40 million combinations of PANGO lineages and individual mutations, across over 7,000 locations, to provide insights for researchers, public health officials, and the general public. We describe the data pipelines that enable the scalable ingestion and standardization of heterogeneous data on SARS-CoV-2 variants, the server infrastructure that enables the dissemination of the processed data, and the client-side applications that provide intuitive visualizations of the underlying data.

Schema Playground: A tool for authoring, extending, and using metadata schemas to improve FAIRness of biomedical data (preprint)

Marco Cano; Ginger Tsueng; Xinghua Zhou; Laura Hughes; Julia Mullen; Jiwen Xin; Andrew Su; Chunlei Wu.

biorxiv; 2021.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2021.09.02.458726

ABSTRACT

Background Biomedical researchers are strongly encouraged to make their research outputs more Findable, Accessible, Interoperable, and Reusable (FAIR). While many biomedical research outputs are more readily accessible through open data efforts, finding relevant outputs remains a significant challenge. Schema.org is a metadata vocabulary standardization project that enables web content creators to make their content more FAIR. Leveraging schema.org could benefit biomedical research resource providers, but it can be challenging to apply schema.org standards to biomedical research outputs. We created an online browser-based tool that empowers researchers and repository developers to utilize schema.org or other biomedical schema projects. Results Our browser-based tool includes features which can help address many of the barriers towards schema.org -compliance such as: The ability to easily browse for relevant schema.org classes, the ability to extend and customize a class to be more suitable for biomedical research outputs, the ability to create data validation to ensure adherence of a research output to a customized class, and the ability to register a custom class to our schema registry enabling others to search and re-use it. We demonstrate the use of our tool with the creation of the Outbreak.info schema—a large multi-class schema for harmonizing various COVID-19 related resources. Conclusions We have created a browser-based tool to empower biomedical research resource providers to leverage schema.org classes to make their research outputs more FAIR.

Subject(s)

COVID-19

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL